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DESCRIPTION 

The VOTRAX SC-02 is a versatile, high-quality, phoneme- 
based speech synthesizer circuit contained in a single 
monolithic CMOS integrated circuit. It is designed to produce 
an audio output of unlimited vocabulary, music and sound 
effects at an extremely low data input rate. 

Speech is synthesized by combining phonemes, the building 
blocks of speech, in an appropriate sequence. The VOTRAX 
SC-02 contains five eight-bit registers that allow software con- 
trol of speech rate, pitch, pitch movement rate, amplitude, 
articulation rate, vocal tract filter response, and phoneme 
selection and duration. 



FEATURES 

• Single low-power CMOS integrated circuit 

• 5 volt supply 

• Extremely low data rate 

• 8-bit bus compatible with selectable handshaking 
modes 

• Non-dedicated speech, ideal for text-to-speech or 
phonetic programming 

• Programmable and hard powerdown/reset modes 

• Switched-capacitor-filter technology 
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Votrax SC-02 Operation Description 

This DATA SHEET is intended to provide VOTRAX SC- 
02 feature and capability information only. Refer to the 

VOTRAX SC-02 USERS GUIDE for complete 
information on application and phonetic programming. 

The Production of Speech 

To produce different speech phonemes (sounds) the 
VOTRAX SC-02 uses a model of the human vocal tract. 
Within the device this analog tract is modeled with five 
cascaded programmable low pass filter sections. The 
filter sections are programmed internally by a digital 
controller. Either a glottal (pitch) or a pseudo-random 
noise source is used to excite the vocal tract, depending 
on whether a voiced or non-voiced phoneme is 
selected. During speech production the phonemes will 
typically last between 25mS and 100 mS. 

The Speech Attribute Registers 

Speech is produced by programming the speech 
attribute (characteristic) data into five eight-bit registers. 
These internal registers allow selection of phonemes 
and speech characteristics. Refer to the Register Input 
Formats for the functional allocations. 

Device Response to Attribute Register Data 

The VOTRAX SC-02 has two general classes of 
attribute data: "control" data (speech rate, filter 
frequency, phoneme articulation rate, phoneme 
duration, immediate inflection setting, and inflection 
movement rate) and "target" data (phoneme selection, 
audio amplitude, and transitioned inflection). The 
VOTRAX SC-02 responds immediately upon loading 
"control" data. When loading "target" data, the device 
will begin to move towards that target at the prescribed 
transition rates. This fully internal linear transitioning 
between target values, done in a manner similar to 
normal speech, is a key factor in reducing control data 
rate without sacrificing speech quality. 

Attribute Register Writing 

The eight bit data bus D7-D0 loads the particular 
attribute register selected by the three bit address bus 
RS2-RS0. To write the data, R/W (Read/Write), CSO 
(Chip Select 0), and CS1 pins must first be in the 0,1 ,0 
state, respectively. The data is then written when at 
least one of these pins changes state. Refer to the Write 
Timing Diagram. Writing is accomplished by changing 
preferably CSO or CS1 . Following device power up, 
nominal values should be loaded into the attribute 
registers as described in the following sections. 

Approximate Data Transfer Rate 

For speech production using the VOTRAX SC-02, the 
actual data rate depends on the amount of speech 
attribute manipulation. For example, the production of 
monotonic speech, where phoneme and duration are 
the only attribute manipulations, requires a data rate 
less than 100 bits-per-second. A higher rate of about 
500 bits-per-second is required for high quality speech 
due to the associated full attribute manipulation. 



Selectable Operation Modes 

The state of the Duration/Phoneme Register bits DR1 
and DRO determine the operating mode of the device 
when the Control bit (CTL) is changed from a logic one 
to a logic zero. The four modes of operation include 
choice of timing response between "frame" or 
"phoneme" timing (as explained below), transitioned or 
immediate inflection response, and setting the A/R 
(Acknowledge/Request Not) pin active or disabled. 
Refer to the Mode Selection Chart for further 
information. 

Phoneme Selection 

The VOTRAX SC-02 can produce the 64 phonemes 
listed on the Phoneme Chart. Bits P5-P0 are used for 
phoneme selection. The relative phoneme duration is 
set by bits DR1 and DRO. 

Phoneme Articulation Adjustment 

A particular phoneme is produced by the combination of 
the vocal-tract resonator filter settings, excitation 
source type, and source amplitude. When a new 
phoneme is selected, the device performs a linear 
transition to the new set of characteristics. The rate of 
this transition is controlled by the articulation setting, 
bits TR2-TR0. This rate is relative in that articulation is 
not affected by speech rate bits R3-R0. A typical 
articulation register setting is "5". 

Programming Inflection (Pitch) 

When the VOTRAX SC-02 is in the mode of immediate 
inflection, bits I1 1 -10 provide immediate adjustment with 
seven octaves of base pitch on an even tempered scale. 
With the device in the transitioned inflection mode, bits 
II 0-16 select the target pitch and bits 15-13 determine the 
inflection rate of change. Bits I1 1 , 12, II , and 10 always 
provide immediate adjustment. A typical value used for 
speech production is 120 HZ for male speech and 225 
HZ for female speech where: 

Inflection Frequency = XCK Frequency 

8 X (4096-1) 

I = decimal equivilent of Inflection Register setting 
Filter Frequency Setting 

Data bits FF7-FF0 set clock frequency for the switched- 
capacitor vocal tract filters. This determines overall filter 
frequency scaling, producing a vocal tract of the desired 
"length". Inflection pitch is not affected by these bits. 
Typically this is set to give a clock frequency of about 
20KHz (see formula below), but may be manipulated to 
fine-tune speech quality or to change "voicetype", male, 
female, child, etc. 

Filter Frequency = XCK frequency 
2 (256 - FF) 

FF = decimal equivalent to the Filter Frequency 
Register setting 

Speech Rate 

Rate of speech is controlled by bits R3-R0, the Speech 



Rate Register. In Frame Timing Mode, new attribute 
data is requested at the end of a "frame" where: 

Frame Duration = ^096 X (16-R) 

XCK frequency 

R = decimal equivalent of Rate Register setting 
In the Phoneme Timing Mode the frame duration is 
modified by the phoneme duration bits DR1 and DRO 
where: 

Phoneme Duration = (Frame Duration) X (4-D) 
D = decimal equivalent of Duration Register setting 
All internal attribute transitioning is performed relative to the 
Speech Rate Register setting. Speech rate does not affect 
inflection or filter frequency. A typical rate setting is 
hexadecimal "A". 

Amplitude Adjustment 

The overall Audio Output level is set with register bits 
A3-A0. Since each phoneme has a preset amplitude 
relative to other phonemes, it is not necessary to pro- 
gram the amplitude of each phoneme; however, ampli- 
tude changes may be used to enhance the speech 
quality and add emphasis. Amplitude is transitioned 
linearly at rate dependent on the phoneme duration 
setting. A typical amplitude setting is hexadecimal "C". 

Control Bit and Power Down Mode 

Setting the Control bit (CTL) to a logic one puts the device into 
Power Down Mode, a sort of "standby" condition. This bit is 
also set high when the PD/RST pin is brought low and also 
upon power up. The Power Down mode turns off the 
excitation sources and analog circuits to reduce power 
consumption, but maintains the present register settings. 
Upon a Control bit logic one-to-zero transition, the present 
settings of DR1 and DRO determine the operation mode as 
described in the Mode Selection Chart. 

Register Reading 

Device pin D7 becomes an output, as the inverted state 
of A/R, when the device is put into Read (RA/V is a logic 
1 and the chip is selected, CS1 = 0, CSO = 1). Refer to 
the Read Timing Diagram. The register address bits are 
ignored. 

Time Base 

Many different time bases may be utilized (see external 
clock input specifications). It is desirable to establish a 
stable crystal controlled time base from 800 to 
lOOOKHz when DIV2 is set low, or twice the frequency 
when DIV2 is set high. A good time base can be easily 
accomplished with an inexpensive colorburst 3.5795 
MHz crystal in conjunction with a divide-by-two circuit. 
The actual device timing and output frequencies are 
directly related to the time base frequency used. 

Microprocessor Interfacing 

Either the A/R line, or D7 as an output, are used as an 
interrupt to indicate when the duration of a frame or 
phoneme has been exceeded. No detectable degrada- 
tion to speech quality results when several millisec- 
onds occur between data request and load. 



PHONEME CHART 


Hex Code* 


Phoneme Symbol 


Example Word (or Usage) 


00 


PA 


(pause) 


01 


E 


MEET 


02 


El 


BENT 


03 


Y 


BEFORE 


04 


Yl 


YEAR 


05 


AY 


PLEASE 


06 


IE 


ANY 


07 


1 


SIX 


08 


A 


MADE 


09 


Al 


CARE 


OA 


EH 


NEST 


OB 


EH1 


BELT 


OC 


AE 


DAD 


OD 


AE1 


AFTER 


OE 


AH 


GOT 


OF 


AH1 


FATHER 


10 


AW 


OFFICE 


11 





STORE 


12 


OU 


BOAT 


13 


GO 


LOOK 


14 


lU 


YOU 


15 


IU1 


COULD 


16 


U 


TUNE 


17 


U1 


CARTOON 


18 


UH 


WONDER 


19 


UH1 


LOVE 


1A 


UH2 


WHAT 


IB 


UH3 


NUT 


10 


ER 


BIRD 


ID 


R 


ROOF 


1E 


R1 


RUG 


IF 


R2 


MUTTER (German) 


20 


L 


LIFT 


21 


LI 


PLAY 


22 


LP 


FALL (final) 


23 


W 


yVATER 


24 


B 


BAG 


25 


D 


PAID 


26 


KV 


TAG 


27 


P 


PEN 


28 


T 


TART 


29 


K 


KIT 


2A 


HV 


(hold vocal) 


2B 


HVC 


(hold vocal closure) 


20 


HF 


HEART 


2D 


HFC 


(hold fricative closure) 


2E 


HN 


(hold nasal) 


2F 


Z 


ZERO 


30 


s 


SAME 


31 


J 


MEASURE 


32 


SCH 


SHIP 


33 


V 


VERY 


34 


F 


FOUR 


35 


THV 


IHERE 


36 


TH 


WIIH 


37 


M 


MORE 


38 


N 


NINE 


39 


NG 


RANG 


3A 


:A 


MARCH EN (German) 


38 


:GH 


LOWE (French) 


30 


:U 


FUNF (German) 


3D 


:UH 


MENU (French) 


3E 


E2 


BITTE (German) 


3F 


LB 


LUBE 



*Note — Hex codes shown with DRO, DR1 = (longest Duration) 
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VOTRAX SC-02 PIN ASSIGNMENT DESCRIPTIONS 



Pin No. 


Symbol 


Active 
Level 


Description 




1 


AO 




Analog Audio Output biased 
@ VdD/2 requires an 
external audio amp for 
speaker drive 


2 


AGND 




Analog Ground 


3 


TP1 




Do not use 


4 


A/R 




Acknowledge/Request Not 
— open collector output 
changes from high to low 
level after phoneme is 
generated. May be used as 

o n i ntprri int rpni ip*^t for npw 

phoneme data. (See Pin 17 
also.) 


5 


TP2 




Do not use 


6 


RS2 




Register Select Input - used 
to select one of five i nternal 
registers in conjunction with 
RSI and RSO 


7 


RSI 




Register Select (See pin 6) 


8 


RSO 




Register Select (See pin 6) 


9 


DO 




LSB of 8-bit data bus — 
input only 


10 


D1 




Data Input (only) 


11 


D2 




Data Input (only) 


12 


DGND 




Digital Ground 


13 


D3 




Data Input (only) 



PinNo. 


Symbol 


Active 
Level 


Description 


14 


D4 




Data Input (only) 


15 


D5 




Data Input (only) 


16 


D6 




Data Input (only) 


17 


D7 




MSBof 8-bit data bus. Bi- 
directional, inverse of pin 4 
when read is high 


18 


PD/RST 


Low 


Power Down Control Input — 
Silences audio.output and 
retains DC bias without 
disturbing register contents. 
Disables A/R output. 


19 


CSO 


High 


Chip Select Input 


20 


CS1 


Low 


Chip Select Input 


21 


R/W 




Read/Write Control Input — 
Write is active low for load- 
ing internal registers. Read is 
active high but enables D7 
only. 


22 


XCK 




Clock Input (^ 1 or 2 MHz) 


23 


DIV2 


High 


Clock Divide by Two — used 
when external clock is csf 
2 MHz 


24 


VDD 




Positive Voltage Supply 



REGISTER INPUT FORMATS 



Register Address 


Register Name 


Bus Input Bit Position 


RS2 


RS1 


RSO 




D7 


06 


D5 


D4 


D3 


02 


01 


00 


LO 


LO 


LO 


Duration/Phoneme (DR/P) 


DR1 


DRO 


P5 


P4 


P3 


P2 


PI 


PO 


LO 


LO 


HI 


Inflection (1) 


no 


19 


18 


17 


16 


15 


14 


13 


LO 


HI 


LO 


Rate/Inflection (R/l) 


R3 


R2 


R1 


RO 


111 


12 


11 


10 


LO 


HI 


HI 


Control/Articulation/Amplitude (C/A/A) 


CTL 


T2 


T1 


TO 


A3 


A2 


A1 


AO 


HI 


X 


X 


Filter Frequency (F) 


F7 


F6 


F5 


F4 


F3 


F2 


F1 


FO 



DR1, DRO . . Define the phoneme duration. 
P5 -*-P0 . . . Address the phoneme required. 
I1 1— 10 ... . Define inflection target frequencies 

and rate of change. 
R3-^R0 . . . Define the rate or speed of speech. 
CTL Define the mode of A/R response in 

conjunction with DR1 and DRO. 

Also directly set by PD/RST. 



T2— TO .... Define the rate of movement of the formant 
position for articulation purposes. 

A3— AO . . . Define the amplitude of the output audio. 

F7— 'FO . . . Define the frequency of all vocal tract 
filters. 



WRITE TIMING DIAGRAM 



READ TIMING DIAGRAM 
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•Valid data latched on first rise or fall of RAA/, CSO or CS1 into inactive. 



Timing Characteristics (Vpo = 4.5 to 5.5 Volts, TA =-40 to +85 deg. C) 



Item 


Symbol 


Limits 


Units. 


Min. 


Max. 


Data Setup Time 


TS 






nsec 


Data Hold Time 


TH 






nsec 


Strobe Width 


TWS 


200 




nsec 


ReadAA/rite Cycle Time 


TRW 


2.25* 




jLisec 


Rise/Fall Time 


TE 




100 


nsec 


D7 Output Access Time 


TACO 




180 


nsec 


D7 Output Hold Time 


THR 




180 


nsec 



Notes: * Based on color burst frequency. 

Timing relative to deselect by either CSO, CS1, or R/W changing. 



MODE SELECTION CHART 



DR1 


DRO 


*CTL' BIT 


Function 


HI 


HI 


HI— LO 


A/R active; phoneme timing response; transitioned inflection (most 
commonly used mode) 


HI 


LO 


HI— LO 


A/R active; phoneme timing response; immediate inflection 


LO 


HI 


HI-LO 


A/R active; frame timing response; immediate Inflection 


LO 


LO 


HI— LO 


Disables A/R output only; does not change previous A/R response 



ABSOLUTE MAXIMUM RATINGS 



Item 


Symbol 


Limit 


Units 


Supply Voltage 


vdd— vss 


7.0 


V 


Input Voltage 


V|N 


-0.5 to Vdd + 0.5 


V 


D.C. Current at Inputs 


•INM 


±1.0 


mA 


Storage Temperature 


Ts 


-55 to +125 


X 


Operating Temperature 


Ta 


-40 to + 85 


X 


Power Dissipation 


Pd 


500 


mW 
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Electrical Characteristics Unless otherwise specified, 4.5 < Vdd < 5.5; —40 deg. C ^ TA < 85 deg. C; 

1.50MHz ^XCK frequency <2.0MHz, when XCK/2 = logic 1 or 
0.75MHz < XCK frequency < 1 .OMHz, when XCK/2 = logic 



Description 


Conditions 


Min. 


Typ. 


Max. 


Units 


POWER SUPPLY 


Supply Current 


PD/RST=1, CTL = 




8 


20 


mA 


Supply Current | PD/RST = 0, CTL= 1 




7 


18 


nnA 



AUDIO OUTPUT 



Output Level 


AW phoneme 

RL = 50Kohm to GND through cap. 


0.28VDD 


0.37VDD 


0.50VDD 


Vpp 


DC Output Offset 




0.5VDD 


0.6VDD 


0.7VDD 


V 


Resistive Loading 


AC coupled to AO to GND 


10 






Kohm 


Capacitive Loading 


To GND to ensure Stable A 






100 


pF 



Description 


Conditions 


Symbol 


Min 


Typ 


Max 


Units 


BUS CONTROL INPUTS, DATA INPUTS (RSO, RS1, RS2, CSO, CS1 , D0-D7 PD/RST) 


Input High Voltage 




V|H 


VsS + 2.4 




Vdd + 0.3 


VDC 


Input Low Voltage 




V|L 


—0.3 




+ 0.8 


VDC 


Input Leakage Current 


V|N = o to Vdd 


l|N 






5 


/xA 


Input Capacitance 


V|N =0 Ta = 25°C 
measured at f = 1.0MHz 


C|N 






10 


pF 


Input Capacitance, D7 Input 




C|N(D7) 






20 


pF 


Input Current, D7 in 
TRI-State "OFF" State 


V|N = 0.4 to 2.4 V 


l|N(TS) 




2.0 


5.0 




D7 OUTPUT 


D7 Output Low Voltage 


'Load = 0-4 mA into D7 


V0L(D7) 






0.4 


VDC 


D7 Output High Voltage 


•Load = 205 /xA out of D7 


V0H(D7) 




VdD-2.0 




VDC 


A/R OUTPUT 


Output Low Voltage 


Il = 3.2 mA into A/R 


IOL(A/R) 






0.4 


VDC 


Output High Leakage Current 


vout = o.o to Vdd 


lL(A/R) 






10 


fiA 


Output Capacitance 


VOut = VDC TamB = 25X 
f = 1.0 MHz 


Cout(A/R) 




15 


pF 




DIV2 INPUT 


Input Low Voltage 




V|L(DIV2) 


-0.3 




■2 Vdd 


V 


Input High Voltage 




V|H(DIV2) 


■8Vdd 




Vdd + 0.3 


V 


Input Leakage 


ViN = o to Vdd 








5 


/xA 



Description 


Conditions 


Symbol 


Min. 


Typ. 


Max. 


Units. 


XCLK 


Input Low Voltage 




V|H(IC) 


—0.3 




+ 0.8 


v 


Input High Voltage 




V|H(IC) 


2.4 




Vdd + 0.3 


v 


Input Current 


V|N= 0.0 to Vdd 


l|N(C) 






5 




Input Capacitance 




C|N(C) 






10 


PF 


Duty Cycle 




D(XCLK) 


0.4 




0.6 





TYPICAL MICROPROCESSOR IMPLEMENTATION 



ROM 



I/O PORTS 




02 = 1.0MHz 



TYPICAL CRYSTAL DRIVEN VOTRAX SC-02 HARDWARE IMPLEMENTATION 



+ 5V 
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REGISTER SELECT 
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RS2 . 
RS1 - 
RSO ■ 
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DO 
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D6 
07 



WRITE STROBE 



AUDIO 
AMP IN 



SL 



10 



11 



12 



AO 

AGND 

A/R 



VDD 
DIV2 
XCK 
R/W 
CS1 

RS2 CSO 
RSI PD/RST 



23| 
22 



RSO 
DO 
D1 
D2 

DGND 



D7 
D6 
D5 
D4 
D3 



24 



■O +5V 



330MF 
6V 



21 



18 



-O + 5V 



17 



16 



15 



14 



13 



+ 5V 




CD4013 



CD4069 




3.579545MHZ 
NTSC COLORBURST 
XTAL 



+ 5V TO CD4013,PIN 14 
CD4069,PIN 14 

GND TO CD4013,PIN 7 
CD4069,PIN 7 



LOUD 
SPEAKER 



No responsibility is assumed by Votrax for use of this product 
nor for any infringements of patents and trademarks or other 
rights of third parties resulting from its use. No license is 



granted under any patents, patent rights or trademarks of 
Votrax. Votrax reserves the right to make changes in 
specifications at any time and without notice. 
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